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Abstract 

We explore the possibility of applying the framework of frequent pat- 
tern mining to a class of continuous objects appearing in nature, namely 
knots. We introduce the frequent knot mining problem and present a so- 
lution. The key observation is that a database consisting of knots can be 
transformed into a transactional database. This observation is based on 
the Prime Decomposition Theorem of knots. 

1 Introduction 

Many algorithms have recently been developed for mining frequent patterns. 
Traditionally, these patterns consist of subsets of attributes in a relational 
database p. Recently, other patterns have been mined, such as trees [57] and 
graphs |l.'il 1141 1251 HOj . However, most objects appearing in nature lack the 
discrete character of graph and trees. In this paper we explore the possibility of 
applying the framework of frequent pattern mining to a class of continuous ob- 
jects appearing in nature, namely knots. A knot can be thought of as a piece of 
rope (where the rope has zero thicknes) which forms a loop in three-dimensional 
Euclidean space R 3 . Figure H shows an example of a knot known as the Trefoil 
knot. 

The history of knots dates back to the late 1800's when Lord Kelvin sug- 
gested that atoms where knots in an invisible and frictionless fluid. Since 
then, theoretical properties of knots are extensively studied in mathematics [3]. 
In physics, knot invariants (e.g., the Jones polynomial) are used in statistical 
physics ]F2\ an d knots also appear in the context of quantum gravity [2]. Re- 
cently, knots showed up as building blocks for future quantum computers [T5*] . 

In biology, knots are used to characterize topoisomerase enzymes |22| and 
in polymer science, physical properties of long ring polymers, such as DNA, 
gels and rubbers are related to properties of knots ^|SJ|^]. It is shown that 
knots are present in such polymers with probability one when the polymers are 
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long enough |2T]. There is also much interest in developing artificial knotted 
biopolymers as building blocks for DNA-based computing |18|. 

The study of knotted polymers is done both by using experimentally ob- 
tained knots and by using knots obtained by numerical approaches based on 
self-avoiding random- walk simulations. Examples of questions one would like to 
answer in these studies are what is the probability of having a certain knot in 
polymers of a certain length |19| . and whether the knots appear tight or loose 
in the knotted polymers [TT] . 

In this article we consider the frequent knot mining problem which can be 
stated as follows: Given a collection of knots, find all subknots which appear 
frequently in this collection. 

We believe that finding frequent subknots in a large collection of real or 
simulated knotted polymers, will contribute to a deeper understanding of the 
statistical properties of knotted polymers in K 3 . This article reports a first 
attempt for solving the frequent knot mining problem. 

The solution presented in this article consists of three steps: 

1. Encoding of knots in transactions; 

2. Mining these transactions; and finally, 

3. Decoding of the frequent itemsets into knots. 

The article is organized as follows: In Section 2, definitions are given and the 
frequent knot mining problem is stated formally. The encoding (decoding) of 
knots (transactions) into transactions (knots) is described in Section 3 . In Sec- 
tion 4, we present the KnotMiner algorithm for mining frequent knots. Finally, 
conclusions are drawn in Section 5. 

2 Preliminaries 

A knot K can be thought of as a piece of rope (where the rope has zero thickness) 
which forms a loop in three-dimensional Euclidean space R 3 . Two knots K and 
K' are equivalent, or in symbols K = K\ if they can be transformed into each 
other without cutting and pasting the ropes. We will only consider so-called 
tame knots. These are knots which are equivalent to piecewise linear knots, i.e., 




Figure 1: Trefoil knot. 
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Figure 2: Sum of two knots. 



knots consisting of a finite number of straight lines. A knot is trivial if it is 
equivalent to a rope which forms a circle in a plane in R. . 

A knot can be finitely represented by a knot diagram. The knot diagram of a 
knot if is a connected undirected planar graph, which correspond to a (generic) 
projection of A onto a plane. Vertices in a knot diagram correspond to places 
where the projection of the knot intersects, and each edge adjacent to a vertex is 
labelled as an undercrossing or overcrossing, whichever is the case. Given a knot 
diagram consisting of n vertices one can find in time polynomial in n a piecewise 
linear knot such that its ^-projection gives the original diagram 0. Two knot 
diagrams can be transformed into each other using the so-called Reidemeister 
moves if and only if they represent equivalent knots |3| . 

The connected sum of two knots Ki and A 2 is formed by removing a small 
piece of rope from both knots and then connecting the four endpoints by two 
new pieces of rope in such a way that no new crossings are introduced, the 
result being a a single knot, which is denoted by K = K\#K%. This operation 
is illustrated in Figure The connected sum K\^Ki is equivalent to A 2 #Ai 
and {K\#K%)#K3 is equivalent to Ai#(A 2 #A3). The connected sum of a 
knot K and the trivial knot is equivalent to K [3]. 

A knot is called 'prime if for any decomposition as a connected sum, one of 
the factors is the trivial knot. There are infinitely many prime knots. 

Theorem 1 (Prime decomposition Theorem |3l I17p Every knot K can 
be decomposed as a connected sum of nontrivial prime knots. If K = Ai^A^ 
■ ■ ■ #K m and A = L\j^Lij^ ■ ■ ■ #A rl7 where Ki and Li are nontrivial prime 
knots, then m = n, and after reordering each Ki is equivalent to Li. 

This theorem motivates the following definition. Let 



Then, A is a subknot of L, or K < A, if for any i = 1, . . . ,p we have that 



A = A 1 #A 2 #---#A ) 
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and let 



A — Ai#A 2 # • ■ • #A g . 



\{j\K j =K i }\<\{j\L j =K i ) 
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A knot datatabase P is a finite collection of knots. The support of a knot K 
in V is defined as 

supp(_ftT) = \{LeV\ K < L}\. 

The frequent knot mining problem can be stated as follows: Given a knot 
database V and a threshold value a € N, find all knots K in V such that 
supp(iC) > a. 

For completeness, we also state the frequent itemset mining problem. A 
transaction database T is a finite collection of fc-tuples in N fe . The support of 
an itemset / in T is defined as 

supp(J) = |{J e T | W : (T)e < (J)t}\, 

where (resp. (J)t) denotes the £th component of I (resp. J). The frequent 
itemset mining problem is then: Given a transaction database T and a threshold 
value cr G N, find all itemsets I in T such that supp(J) > a. 

3 From Knot Databases to Transaction Databases 

In this section we show how to transform a knot database T> into a transactional 
database. We assume that the knots in V are represented by knot diagrams. 

We start by computing for each knot K in T> its prime decomposition. Schu- 
bert |17j gives an algorithm computing this decomposition. The running time is 
at worst exponential in the number of vertices in the knot diagram. In this way, 
we obtain a set primes(2?) consisting of knot diagrams for all prime knots oc- 
curring in T>. Two different knot diagrams in primes(2?) can represent the same 
prime knot, so we have to eliminate duplicates. There exists an algorithm for 
testing whether two knot diagrams represent equivalent knots [HI 123) . However, 
at present, the complexity of this algorithm is not known. From here on, we 
assume that primes(£>) does not contain duplicates and order it arbitrarily. 

We now define a mapping, denoted by encode, from knots in a knot database 
to elements in a transaction database. Let T> be a knot database, and K a knot 
in T>. Then, 

encode(i^) = (m, . . . , n p ), 

withp = |primes(X>)| and rij is the number of times the prime knot corresponding 
to the ith knot diagram in primes(2?) appears in the prime decomposition of K. 
Clearly, encode(2?) is a transaction database consisting of |primes(£>)| attributes. 

Given a set of knots K, — {Ki, . . . , K p }, we now define the mapping, denoted 
by decode, from itemsets of a transaction database T C N p to knots in M 3 . Let 
teT and let (i)^,...,^ = (mi, . . . , m^) be a fc-itemset. Then, 

decode^, ...,m k ) = K il #>-- #K h # • • • # K ih # ■ ■ ■ #K lk . 

S v ' V v ' 

mi times times 

For any knot K, we have that K = decode(encode(X)). 
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4 Algorithm 



We now present The KnotMiner algorithm for computing the frequent knots in 
a knot database. 



Algorithm 1: KnotMiner 
Input: knot database T>,a 
Output: All knots K such that supp(K) > a. 

1: Compute T := encode(2?) 

2: Compute the set T of frequent itemsets in T 

3: Output decode(JF). 



The first and last step in KnotMiner arc already fully explained in Section 
III. For the second step one can either transform T into a binary transaction 
database and use a standard mining algorithm like Apriori [T], Eclat or FP- 
growth 0. Alternatively one can mine T directly using algorithms presented 
in [201 and [T2|. The following result is immediate. 

Theorem 2 The KnotMiner algorithm works correctly. 

5 Concluding Remarks and Future Work 

In this article we introduced the frequent knot mining problem and proposed 
the KnotMiner algorithm to solve it. Currently, there exists no implementation 
of KnotMiner. This is mainly due to the complex algorithms needed for the 
encoding of a knot database into a transactional databases. 

However, recent research indicates that the knot decomposition of knotted 
polymers can be obtained by "Coulomb decomposition" , which is a technique 
where polymers are brought into an equilibrium state using Coulomb interac- 
tions UJ- We hope to apply this technique on simulated knotted polymers and 
hence obtain an implementation of KnotMiner, specifically aimed for mining 
knotted polymer databases. 
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